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*A'S'>t§. cifttt, sss*h> < 7 x mm** <DmmKm&? % rabicv- ex • 7 
) 

[0014] 

r+tr>y^li, * ft , ^<©K*Ofe»K.7^-bX'7.a/W^A^7x7'ifyh* 
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„ <K - i? 9 ) 
[0018] 

rttr^^iaitfi^l^, 7"77f*^©3>f yyogjRt, 8 € * - ft^ 6> 30 
WSftS^yr^y*', g*t 5ny(f a-?fr6, 3yfy75rjfo3yei-?s 
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^-■>'f*5*'Utl4V\ HTTPU--MB, 7i7 • 777^^, -glC-O<07r 
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-fX^tiffitSilttfT'f 5, yp^J/'lt-MB, 31^, £31 * y h 7 - * £ ftg|5 
© * y h -7 - 7 P> # K f § y - h 7 x ••9--^*5^fi*<0-»K, 9: fc tt ife H * y 
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SIOftl\ feSV^iM^CEItfai^ r-f*i LT®#S ftfc, 7x7*^-^(0 r@ 
[ 0 0 3 7 ] 

5. ifOX^^Ii, JaffiWK3fffl©lii&*i*»5i:fc iS3Il':'*-*y 
h • / W y # - y n £ M £ ft £> ;P - r * > © X H * K « m 4 L & V> . X^yf L (f 
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BANDWIDTH SAVIN03 AND QoS IMPROVEMENT FOR WWW SITES 
BY CACHING STATIC AND DYNAMIC CONTENT ON A DISTRIBUTED NETWORK 
OF CACHES 

5 TECHNICAL FIELD OF THE INVENTION 

The Invention relates it> the siorage ami transport of Information on an internet and, more 
particularly, to caching Web-page content on the Internet. 

10 This is a confmuation-in-port of commonly-owned, copending U.S. Provisional Patent 
Application No. 6XV2 1 8,559, filed 1 7 Jury 2000. 

BACKGRO UND OF THE INV ENTION 

When two or more computer* are connected, so that they can share resources, this is referred 
15 to as a "network". When two or more networks are connected together, (hU is referred lo as 
an "internet" (lower case 1*). The "Internel" (upper cose T) is a a global, hiternet, which is a 
plurality of intcr-connected networks, oil using a common protocol biown as TCP/IP 
(Transmission Control Prolocol/Inlemei Protocol). 

20 The World Wide Web ("WWW, or simply "Web") is n set of standard"? and protocols that 
enable the exchange of information between computers, more particularly hypertext (HTTP) 
servers, on the Internet, tying tbera together into a vast collection of interactive multimedia 
resources. Traffic across the .Internet passes through multiple "backbone" carriers, such as 
UUNct or Digcx, which forward (he signals to one another under a *ysfeiu called peering. 

25 

"Quality of Service" (QoS) is a measure of the spend ond reliability wtih which Information 
can be transported from a content source (typically a server, or an Internet Service Provider, 
ISP) to a user's computer (upon which is typically running a type of "browsci" software tliat 
is used to view content). "Bandwidth" is a measurement of tlie volume ofurJurmalion that 
30 can be transmitted over a network (or a given portion thereof) at a given time. The higher tlx: 
bandwidth, the more data can be transported. Latency is a measure of the lime tjiat it takes 
for a packet of data to traverse the Internet, from source to user. 

Bandwidth con be increased by installing new capita) resources, but increasing bandwidth 
33 only indirectly addressee latency, and increasing bandwidth cannot accelerate overloaded or 
slow origin servers. Other solutions arc needed to overcome these barriers. One solution, 
caching content, is discussed at length ;n an article entitled "Network Caching Guide, 
Optimizing Web Content Delivery", by Michael A. Ooulde, March J999, which is available 
online (via the Internet) at the following address: 
40 rrttp:/Avww.biktpmi.com/pro^ 

A text version of the article is available online at the following address: 
http://ww.v.goc#Ie.com/$earch?q^ 

techjesowceVtcch/cachir^guidc.pdf 

45 As noted jo the CJoulde article: 

"One way tq improve performance and make it more predictable is to 
minimize the variability introduced by long trips across the Internet from the 
content source to the user's browser. By storing frequently accessed content at 
so . a location closer to the user, a great deal of latency awl impredictable delay in 
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iho Interact can be eliminated. The technique for doing this is called caching. 
C&ching is a mews of storing content objects from a Web server closer to tlx; 
user, where they can be retrieved more quickly. the storehouse of objects, 
inchiding text pages, images, and other content, is called a Web cache 
5 (pronounced cash)." (Gouldc, page 2) 

Caching is an approach ibat improves QoS far users while improving the 
economics for network service providers. Caching entails storing frequently 
accessed Web content closer lo users, thereby reducing the number of hops 
10 thai most be traversed in order retrieve content chni resides on a remote site 

In operation, requests lor content originate from a user's browser and axe first 
directed to the caching server. If the content is currently contained in the cache 
15 and is up to date, the content is sent lo the user without the browser 's request 

having to be sent on to the originating server. This can reduce the latency for 
the transfer of content and reduce the amount of time it takes to transfer tbe 
content to tbe user. It also reduces the amount of data the service provider has 
to retrieve upstream from tbe Internet to fulfill requests, ({ieulde. page 8) 

20 

Caching is a concept that is well understood in computer hardware design. 
Modem microprocessors employ on-chip caches of high-speed memory lo 
store frequently used instructions ami data locally instead of having In read 
them from slower memory. Computer* implement several additional levels of 

25 caching* including RAM cache and even cm-disk cache, a)l designed to reduce 

tlic latency for reading instructions ond data and speeding up (he transfer of 
data within the system. The basic principle behind caching is to store 
frequently accessed data in a location that can be accessed more quickly than 
it couW from the data's more permanent location. In the case of a system's 

3D CPU,, the permanent location is the disk. On the Web, the permanent location 

is the origjn server .wmewfterc out on the Internet. (Gouldc . pnge fc-9) 

Network caching, while a newer concept, is based on exactly the same 
principle and has the same benefit of cost-effectively improving the user 
35 experience. (Gouldc. page 3) 

"Caching provides direct benefits to the end user in terms of minced latency 
of Web page download (the lime you have to wait before anything starts to 
happen) and faster download speeds (the time it take* for all tbe downloading 

40 to finish). Bui caching also provides benefits to network service providers. By- 

storing, or caching, Web content that users request from remote servers 
locally, fewer of these requests have to be sent out over the lotcmet to he 
fulfilled. As a result, the access piuvider maintaining a cache has its upstream 
bandwidth requirements reduced. This reduces tho bandwidth the service 

4? provider has to purchase in order to provide its customers with a satisfactory 

Web experience. (Gouldc. page 3) 

"Caching also provides several benefits in a Web hosting environment in 
which an uecess provider hosts Web sites for marry customera. lr can reduce 
50 the impact of traffic spikes caused by high-interest content and also serve ns 
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the basis for a variety of value-added service* such as providing additional 
enpocily, guaranteed service levels, and replication service*. fgqJdc. page 3) 

"[Caching] network data makes it possible lo provide users with an optimal 
experience and predictable response lijoes- With typical cache hit rales, the 
user experience has u higher quality of service (QoS). Improved QoS provides 
sigraficant benefits to ISPs, content providers, mid corporate sites. Better QoS 
results in higher customer loyalty and retention. It helps create a stronger 
brand equity, both for the access provider and fox content providers. Content 
providers who ensure llxtf their content is amply cached throughout the 
network , close to users, will Ultimately see more visitors accessing their 
content. (Goulde. page 5) 



"Web browsers also implement n fonn of caching, storing recently a 
Web content on the user's hard <hsk and reading from that local cache affiles 
iroiead of accessing the blartet. This works weB when the user hits the 
'Back* and "Forward" buttons in their browser during a session, hut it docs 
nothing if the user is browsing a site for the first lime. (Gouldc, page 9) 

"Neither (he browser's cache nor a "Web server's cache can address network 
performance issues. By placing a cache of Web content on the network 
between the user and ibe originating Web sites, the distance that commonly 
accessed content has to travel over the Internet is reduced, and users 
experience quicker response aod faster performance. Network caching takes 
advantage of the fact that sonic content is accessed more frequently that oilier 
content. Uy implementing o caching solution, network service providers can 
provide » better Web experience to tbeir customers while reducing the total 
requirement for high-bandwidth connection* to the Internet. (Gouldc. page 9) 

"When a caching solution is not in place, requests for content from a browser 
and the content delivered from the origin server must repeatedly traverse a 
complete trip from tbe requesting computer to the computer that Iws the 
content. Tbe Web browser sends a request for a Uniform Resource Locator 
(URL) that refers to a specific Web page on a particular server on the Internee 
The request is routed to the server through the norrnaJ TCP/JP network 
transport. The content requested fro© the Web server (also known as an HTTP 
server) may be a static H'1"ML page with links to one or more additional Ifles, 
including graphics. The content may also be a dynamically created page that is 
generated from a search engine, a database query, or a Web application. The 
HTTP server returns the requested content to the Web browser one GJe at a 
time. Even □ dynamically created page often has static components that are 
combined with the dynamic content to create the final page. (Gouldc page 

"When caching is used, frequently accessed content is stored close to the user, 
ll may be stored inside a firewall ou the usert corporate LAN, at the user's 
ISP, or at some other Network Access Point (NAP) or Point of Presence (POP) 
located closer to the user (ban the majority of Web servers. If U>c requested 
content, or fijc, is located in the cache and is current, or fresh, that is 
considered a cache hit. The more frequently user requests can he served from 



(34) 



JP 2004-504681 A 2004. 2. 12 



pt-r/iuumw.M 



these caches, the hig}>cr the hit rate, the better the user 's performance, and the 
lew dutn ihe service provider has to retrieve from (Jxj Internet on behalf of the 
user. (Goulds, page 14) 

5 "Similar issues exist tor FIT lUe transfers, with nn FTP server handling each 

request for u file submitted by the FTP cliei)t application. Delays and 
bottlenecks cm) be u bigger problem with FTP because the typical size of a i'dc 
transferred by FTP is larger than a typical HTML fik. Streaming audio and 
video are additional examples of en Internet application that can benefit from 

i ft caching content close to end users. fflonlde . page 1 4) 

"Network caching can be applied to conteni delivered over many different 
protocols, Ifcese include HTTP. NNTP, FIT, RTSP, and others. AH ore 
characterized by having some proportion of static content and high utilization. . 
15 Cache server support /or each protocol is, of course, required." (page 14) 

"Since (he cncJic must be kepi fresh, there will still be traffic from the JSP out 
to the Interact, even if every bit of content requested by users is in a enche. 
Page freshness has to be assured, and new content must be downloaded. But 

20 by using caching, bandwidth utilization can be optimized, ft is even possible to 

use a cache with dynamic conleni, since even these pages have some static 
content that can be served from a cache. Depending on the distribution of 
traffic awl the scalability of the cache, up to 40 percent of user HTTP requests 
can be taken off the network ami served from a coche. This makes networks 

» more efficient, enabling better performance to be offered at lower cost." 

page 14) 

The Gould article discusses the effective use of caches, load balancing, where to locate 
caches in the infrastructure, and designing enche-niewdry web content. There is also 
30 mention of protocols which have been developed . for example, Web Cache Control Protocol 
(WCCP) (page 1 8). Ihcrc is also discussion of appropriate use of the "expire" and 'max- 
ago" headers in the HTTP protocol (fiouldc. page 27). And, as expressly stated by Gould, 

"With good design even dynamically generated pages can benefit from 
3$ caching. By separating the dynamic content of a page from the sialic content, 

the static content can be cached and the dynamic conteni retrieved and 
downloaded separately. (Gonlde. na&c 28: emphasis supplied) 

It is therefore known to cache static data from Web sires using a dedicated computer's RAM 
•ift and hard disk, so that this computer can act as a proxy between the WWW (■web") servers 
nod their users. Several requests Tor a given Web page can then be served el (ho cost of a 
single request to the original Web server. Internet Servico Providers (ISPs) commonly 
provide soc)> a caching servico to their customers. However, this technique suffers from two 
main drawbacks: 

is - it is not applied globally, and even users of the Web sites who have access to a cache 

have to deliberately activate this feature, of which they are often not aware; and 

- it does not apply to pages which change very often, as is the case wilb dynamic 
conleni (content which is generated on-thc-fly by the remote server). 



» There therefore exists a need ftw the caching of dynamic content, as well as static content. 
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GloisriTy/Dtffniftotu 

Unkss otiwfwise noted, or as may be cvideni from the context of their usage, any terras, 
abbreviations, acronyms or scientific symbols ami notations used herein arc to be given their 
ordinary meaning in the technical discipline lo which the invention most nearly pertains. The 
following ylossajy of terms Is intended to lend clarity and consistency to Ihe vrjrious descriptions 
contained herein, as well st> in any prior njf documents which may be cited: 



to Cache Server A highly optimized application that stores frequently accessed conient 

at strategic aggregation points close to the users requesting that content 
in order to reduce the impact of delays and network bottlenecks. 

CARP Cache Array Koutinrj Protocol A protocol for synchronizing 

JS multiple cache servers. CARP maintains a shared namespace that maps 

any Web object** address (URL) to only one node in the array. 
Requests are routed to that node. 

Cookie The most common meaning of , *Cookie" on ihe Internet refers to a 

2tr piece of information sent by a Web Server to a Web Browser that the 

Browser software is expected to save and to send back to the Server 
whenever the browser makes additional requests from tbe Server. 
Depending on the type of Cookie usod, and tie Drowser's settings, the 
Browser may accept or not accept the Cookie, and may save the 
25 Cookie for cither a short time or a long time CnoJcies might contain 

information such as login or registration information, online "shopping 
cart" mformutinn, user preferences, etc.. When a Server receives t» 
request from a Browser that includes a Cookie, the Server is able to use 
the information stored in the Cookie. For example, the Server might 
30 customize what is sent back 10 the user, or keep 3 log of particular 

nscr'.i requests. Cookies are usually set to expire after a predeicnnincj 
amount of time and are usually saved tn memory until the Browser 
software is dosed down, at which time they may be saved to disk if 
their 'Yxpirc time" has not been reached. 

35 

Dynamic Content "Live" content which is updated on a regulur basis. Examples of 
dynamic content might include a "current temperature" display 00 a 
weather web site, search results, or a "Current Top Headlines" item on 
a news web site 

.10 

HTTP Server A server ihat implements the HTTP protocol, enabling it to serve Web 

pages to client agents (browsers). HTTP Servers support interlaces SO 
that Web pages can call external programs. They also support 
encryption mechanisms for securely exchanging information und 
45 aulbertticiuioB and access control mechanisms to control access to 

content 

ICP Internet Cache Protocol. A protocol for synchronizing multiple cache 

servers. Each lime a enche server experiences a miss, it broadcasts 
50 messages 10 all peer nodes raking who her each has ihe con lent. The" 
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requesting server then must issue a request far the content and forward 
it oo to the user. 

Proxy Server A prosy server acts as an intermediary between a user :wd the Internet 

so on enterprise con ensure security and administrative control and nljo 
provide a cacbiog service. A proxy server Is usually associated with or 
psrt of 3 gateway server thai separates the enterprise network from the 
outside network or a firewall that prelects (he enterprise network from 
outside intrusion. 



Static Content 



HTML 



35 MIT 



IP Address 



Protocol 



Routers are Ibc device* that build a fully interconnected network out of 
a collection of po>n(-t«-point links. Reuters on (he internet exchange 
biformation pertaining »o their local section of the network, particularly 
how close they are topologfcaJiy to local systems. They collectively 
build a map of how to gel from any point in the Internet to any other. 
Packets arc routed based oo the exchanged mapping information, until 
the lust router connects directly to (he target system. 

"Fixed" or long-term unchanging component* of web pages stored as. 
files that lire eitlier never changed or arc changed only on an infrequent 
basis. 

High-speed network devices (but typically sit on the periphery of the 
Internet. Switches differ fruni routers in providing higher performance 
at h Jower price bur with limited functionality. Typical switcljcs can 
route traffic locally but aren't concerned with complexities of routing 
found in the high-speed Internet backbone. Switches play an important 
role io caching because they arc often used !p divert the cnvbciblc 
traffic to the caching system. 

Hypertext Markup 1 Jinguagc A specification based on Standard 
Generalized Markup Language (SGML) for tagging text so thai it may 
be displayed in n user agent (browser) in a standard way. 

Hypertext Transmission Protocol. An appjicat too-level protocol (hut 
runs on. lop of TCP/IP, which is the foundation for the World Wide 
Web. 

Internet Protocol, 'ibe network layer for the TCP/IP protocol suite. It is 
a connectionless, best-effort, packet -switching protocol. 

A 32-bit address defined by the Interact Protocol that b usually 
represented in dccimuJ notation. It uniquely identifies each computer 
on (he Internet. 

An agreed-upon set of technic*) rules by which computers exchange 
information. 



URI. 

jo 



Uniform Resource Locator. The method by which Internet sites are 
addressed. It includes an access proloco) and cither an IP address or 
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DNS name. An example is http://wwv»Aloni«in.w«n. 

Short for User's Network. A collection of lens of thousands of bulletin 
boards residing on the Internet, Each contains discussion grwrps wlJcd 
newsgroups dedicated to various topics. Messages arc posted ami 
responded to over the Network News Tninsler Froiocol (NNTP). 



Web Server See HTTP Server. 
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OftlEF DESCWriON OF TB O INVENTION 

An object of tbc invention to provide a technique for ratacfog bandwidth usage of WWW 
servers and improving the QoS of WWW sites. 

3 

Acceding to the invention, a technique for caching objects haviog dynamic contem on the- 
Internet generality comprises disposing a cache in the Internet Tor storing und updating copies 
Of dynamic content. The cache may he disposed at a locution selected from the group 
consisting of major Internet switching locations, dial-in aggregation points, and corporate 
1 0 gateways. The cache may also st ore and update copies of st atic content. 

According to a feature of the invention, irpdue characteristics of the objects are determined, 
and a tin* to live (TTL) parameter for the objects is adjusted based upon the update 
characteristics. Generally, the object is updated if its 'fTL is less than its ngc. The TTJ, for 
15 an objcci may be adjusted to: 

(i) maintain its probability of error below a predetermined error probability tlueshokt; 
(ji) maintain its error rate below a predetermined error probability threshold; or 
<iii) maintain its delay time below » predetermined delay Ihroshold. 

20 According to the invention, n method eff responding to a user request for an object having 
dynamic content, said object originating from a server, comprises storing a copy of the object 
in a cache; establishing a time to live (TTL) for the object; receiving the user request at the 
cache; 

fulfilling the user request with the stored copy of the object if its TTL is greater than its age; 
25 and fetching an updated copy of the object and responding to the user request with the 
updated copy if the TTL of (!*> stored copy is less thon its age. 

According to a feature of the invention, the TTL for the object is first set to o reasonable 
lower limit (Tmin) and is than adjusted based on the frequency at which the object actually 
30 changes. 

According to a feature of the invention, each time the cache fetches tine object from the 
server, tlx: cache performs tho following procedures: 

a. if another fetch for the same object is ongoing, waiting for the previous fetch to 
35 complete, 

b. fetching (he object from the server, 

c. replacing the cached copy, if present, by the fetched object, alter having compared 
thero to determine whether the object had changed since it was last felclicd; 

d. initializing or updating the object's change statistics accordingly; 

40 e. marking the object as static or dynamic eootcnt depending on the server's reply; and 

f. it" the object a dynamic, setting its TTL (T) to an appropriate value with respect to 
an average time between change? (x) determined from the object's change statistics, the 
number of user requests per lime unit (q) determined from the objects access statistics, and 
one of ihc following procedures (A-E): 

45 A. maximum error probability; 

B. maximum error rate; 

C. maximum delay; 

P. any combination of the above (A.B.C), taking the lowest 7"; or 
£. any combination of the above (A,B£), but keeping T within a predetermined 
50 window of "reasonable" values bounded by Tiniii and 7m ax. 
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Preferably, the caches are dedicated machines ond are placed so thai Web browsing passes 
tlirough Ihe cache instead of going all the way lo (he original sites, in many different 
locations, ideally wilhin Ihc network oflSPs providing the Internet connectivity to the highest 
5 number of owrs in those locations. In (his mnnncn 

• Ihc users of those ISPs and, to n lesser extent, neighboring ISPs, will enjuy a huge 
QoS and speed improvement, for most of the traffic will stay within or clow to the ISPs' 
internal octwo&s and not Deed to go through Ihc highly-Mcd Internd backbone; and 

• the original web-sites will no longer need as much bandwidth, since the caches will 
10 absorb most of the load. 

Since marry websites tend to serve ail-dynamic content, dynamic content wiJI be cached, its 
well as static content. However, K is not merely a matter of the caches remembering the 
content, as there is do imlfcution as to when the Jatter is going to change, Hence, it needs to 
l.s be reloaded, periodically, instead of simply serving Ihc cached copy. 

Therefore, the cache reloads a page (fetches an object from the server), whenever its 
corresponding cached copy has not been rcrrcslied for a given time (time to live, TTL"). 
TTL can be thought of as the "shelf life" of the page. The cache first, sets Ihe TTL for n 

20 dynamic object to a reasonable lower limit X**. Then, over time, as the cache reloads the 
page several times, it can track the frequency at which the page actnalry changes content, and 
adjust the TTL for that page accordingly. This m TtL* teclinique mimics a common caching 
method for static pages, if the original .server of Ihc page specifics a TTL. But since servers 
for dynamic pages do not specify a TTL for them, the cache has lo establish a reasonable 

25 TTL of its own accord. 

Therefore: 

- whole web- sites can be cached, including their dynamic content; 

- traffic between caches and the original sites is better managed ; 

30 - while some users might get an outdated content, in the case of a page changing 

before the cache reloads it, it is possible to limit the delay between the two (which could 
happen anyway witliout the cache, when the network gets saturated) as well as ihe error 
probability; 

- the system can adapt, in real time, according to the number of requests to each page 
35 and the actual update frequency of the page. 

Other objects, features and advantages of the invention will become apparent in light of the 
following description thereof. 



40 
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BP 1EF DESCR IPTIO N OF THE DRAWINGS 

Reference will be omde in detail to preferred embodiments of the invention, exnraples of which 
may be illustrated in the accompanying drawing figures.. The figure* we inlcndcd to be 
illustrative, n\A limiting. Although the invention i? generally described in the context of these 
5 preferred embodiments ii should he understood that it is run intended to limit the spirit and 
scope uf the invention to these particular embodiment;; 

In flowcharts presented herein, rectangular boxes generally represent a seqt*ntial step being 
performed, a generally rectangular bo* with pointed ends represents a decision step (lea) 
to having two mnhjally-exchisjve results ( w Y"«Ye<r, >T-No), and an empty circle is not a step or 
a test, but is merely n graphical junction point At which two or more paths in U>e flowchart 
converge. 

The structure, operation, and advantages or the piesent prelerred embodiment of the invention 
IS will become further apparent upon consideration of the following description* take;) in 
conjunction with (he accompanying figures, wherein: 

Figure 1 is a greatly amplified schematic ilhwtraiion of the Internet, illustrating an 
embodiment of the caching system of the present invention; 

20 

Jlgnr* 2 is a flowchart illustrating how user requests for static and/or dynamic content are 
handled by the cache, according to the invention; 

Figure 3 is a graph illustrating an average error probability, according to an analysis of the 
25 invention; and 

yignre 4 is a graph illustrating the evolution of error probability, according to an analysis of 
the invention. 
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DKTAH JP.D DESCRIPTION OF TUF. INVENTION 

Figure 1 is ft greatly simplified schematic illustration of the Internet environment, illustrating 
an embodiment of the caching system of the present invention. Generally, a user 102 /writes 

5 » "request" Tor tin "object'* (eg., a Web page) winch is made available on the Internet by a 
server (e.g., ISP) 1 04. (Tlw object typically originates at a content provide!, not shown.) A 
switch 106 interfaces a cache (cache server) 1 08 to the Internet. The cache may con tab a 
copy of the Web page There me two possible ' responses" to the user request - cither the 
server "serves'* (or "services*") the request, or it is -fulfilled" in the cache. Jn the latter case, 

10 the content must first have been "transferred" to tbo cache, which may periodical ty "fetch" (or 
"reload") updated Web page content from the server. These terms will be adhered to, to the 
extent possible, in the discussion that follows. 

As discussed in the Goukfe article (e.g. F page IS, Illustration 4), the switch 106 may be a 
JS. Mgh-perforrnancc processor that can look ol network traffic ond ma ke tou ting decisions based 
on protocols above the IP level A* a result, the switch can direct HTTP (and other) traffic to 
caches (108), «nd send the rest of the traffic directly to the Internet. This is exemplary of 
one of a number of possible system architectures. Which architecture is used depends ou 
Revere! factors, including where foe cache is implemented, the primary purpose of the cache, 
20 and the nature of the traffic 

As further discussed in the Goujde Article: 

"Caches can either be deployed in a transparent or nontransparem form. A 
15 non transparent cache is explicitly visible, sind browsers or other caches that 

use the cache are overtly configured to direct tralfic to the cache. In this case, 
Ibc enche acts as a proxy agent for the browser, fulfilling requests when 
possible and forwarding requests to the origin server when necessary. 
Nratrantparent caches are often a component of a larger proxy server acting 
30 as part of a gateway or firewall and addressing many different applications. 

fOonldc . oa2es 1617t 

A transparent cache sits in the network Dow and functions invisibly to a 
browser. Fur ISP and enterprise backbone operations, u bnnsparent 
35 configuration is preferred because it minimizes the total administrative and 

support burden of supporting users in configuring their browsers to find the 
cache. ( Gouldc. paae 17) 

Caches should be implemented transparently to maximize the benefits of 
40 caching. A uonlronsparcnt implementation requires having browsers manually 

configured to direct their requests for content to the cache server. In a 
transparent configuration, cache bench ts are delivered to clients without 
having to reconfigure the browser. Users automatically gain the benefits of 
caching. (Goulde . page 17) 

45 

In an enterprise environment, transparency can be implemented cither through 
automatic browser configuration or by intercepting traffic on the network. 
Both "Netscape and Microsoft provide utilities for centrally configuring large 
networks of browsers and for customizing browser configurations being 
50 installed on users' PCs. Browser plug-ins can also provide automatic 
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configuroUun. Althougb (his approach is transparent to (he user, il does 
retjtrin: admuiislrative effort on an ongoing basis. tCjp.trfde. pnge 1 7) 

There arc several options for providing Iranspareni caching: 

- The cache can be configured as if il were a router so lh»l all Internet-based 
traffic is aimed at it. This is a transparent configuration Ibul requires no 
configuration of the browser; the browser or downstream cache is unaware of 
the cache's existence but still benefits from il, The downside is that the system 
on which lite cr>che reside has to devole som« of ils resources to routing, and 
the each* becomes a mission-critical part of the network. Sophisticated router 
configuration with policy-based routing can minimize some of these issues by 
only directing HTTP (TCP Port 80) traffic to the cache, bypassing the cache in 
the event of failure and sending traffic directly to the Internet. ( Gouldc . pngc 
17) 

- An incrcasingh/ popular option is to nse a Layer 4 switch to interlace the 
caclie to fbe Internet (sec illustration 4). these switches, currently offered by 
Alteon, Foundry, ArrowPoint, und others, are higlr-performancc processors 
that can look at network traffic und make routing decisions based on protocols 
above the IP leveL At a restdt, the switch can direct HTTP (and other) trattlc 
to the caches and send the rest of the traffic directly to the Internet ... [fjhc 
switch can parse the HTTP request and send the request to a specific node in a 
cache form based on the URL requested. Using an intelligent swiich keeps 
iinnccessary network traffic off the cache, simplifies designing for availability, 
and distributes loading on the cache farm based on specific URLs. (flpulde. 
page 18) (An architecture similar to this one is described hereinabove wilb 
respect to figure 1) 

- Another option for transparency is lb© Web Cache Control Protocol 
(WCCP). WCCP was developed by Cisco Systems to allow Web caches to be 
transparently installed in networks using Cisco IOS-b*sed routers. With 
WCCP, HTTP traffic Is redirected to the Weh cache instead of the origin 
server. WCCP does not require any changes to the network architecture, 
thereby simplify tng the implementation of transparent Web caching. (Oouldc. 
page 18) 

- Web Proxy Autodiscovcry Protocol (WPAD) is a new proposed stmdnrd 
protocol* which, when integrated with browsers, streaming media chcnls, and 
Other Internet client software, it designed lo automatically locate caches and 
services on th« network without requiring any configuration by end users. 
WPAD provides a flexible, vendor-ncultBl so fl wive alternative to existing 
cache traa<n>arcncy solutions that utilize routing or switching equipment. In 
the fnturc, WPAD-cnablcd client software will automatically connect vu*rs 
wilb embedded network services in their region, providing simplicity for boib 
users and the network providers that deploy these services. (Ooulde . rages 
18-1?) 

Caching systems can be used to optimize the performance of a Web server site 
as Weil os to speed Internel access for Web browser users. In a reverse proxy 
configuration, the caching system aits in fjonl of one or more Web servos, 
intercepting iniffiv to (hose servers and standing in, or proxying, for one or 
more of die servers. Cache servers can he deployed throughout a network, 
creating a distributed network for hosted content. When necessary the proxy 
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cache server will request dynamic and other short-lived content from the 
ori^n servers. This Enables content from »bc Rite to be served frora a local 
cache instead of from tiw origin stiver. The proxy server con be optimized for 
high vdfornwnce, efficient operation, conserving resources, rod off-loading 
5 the origin server from serving static content. Reverse proxy caching provides 

benefits to tbc access provider as weU as to the user Those benefits include 
the ability to enable load balancing, provide pea*- demand insurance to assure 
availability, and provide dynamic mirroring of content for high availability. 
(Gouldc. page 20) 

to 

There ore three general ch-TroctcrisiiM that describe where caches are best 
located on a network: 

- Choke Points in the Network Traffic There are locations where a large 
majority of network traffic posses and is therefore visible to the cache server. 

15 This allow the cache to handle more requests and store more content tlwn ir 

located somewhere Uiat can be eusily bypassed. 

. Points with High-iNclwork Load. High traffic conditions allow higher 
cache utilization mxl therefore greater bcnetUs can be achieved. 

- Locations that Produce Greatest economic Benefits for a Gicbc. Points 
2t> where risers will benefit from high each* hit intes whiJe also reducing 

upstream bandwidth requirements will provide both QoS benefits and 
economics lor tbc access provider, locations with these characteristics are 
typically found at major Internet ."twitching locations, dial-in aggregation 
poiots, or corporate gateways. (Gouldc. page 20) 

25 

Locations with these characteristics are typically found at major Internet 
switching locations dial- in aggregation points, or corporate gateways, 
including: 

POP and DIAL-UP (sec Oould c page 2), Illustration 7) 
30 "NAPS ( see GouMc. page ZZ. Illustration 8) 

Satellite (sec Gouldc. page 22) 

International Gateway (see Gouldc . page 23, Illustration 9) 
Web hosting (see Gouhle, page 24, Illustration 10) 
Last Mile (see Oouldc. page 25, Illustration 1 1) 

3 5 Corporate Gateways (sec Gouldc. page 25, til ustration 1 2) 

A person having ordinary skill in the art will readily understand where to locale the cachc(s) 
of the present invention in fight of the description set forth hereinabove. Generally speaking, 
the cache of the present invention can be located anywhere that there is (or could be) a cache 
40 serving stotio content, or it can be incorporated into an existing cache which fulfilb requests 
for stotic content, with the additional functionality enabled according to the techniques set 
forth below. Or, it con be provided as a separate, dedicated machine (computer). 

Figure 2 is a flowchart {frustrating how user requests for static and/or dynamic content are 

4 5 hand led by the cache 

In a first step 202, for a user request for an object, the cache determines whether the 
requested object is in cache. If not (N), the user request is passed on to the server for 
servicing the request and meanwhile, in a step 204, the cache fetches the object from die 
5V server in anticipation of the next request for Oic object from the same or another user. 
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Generally, in this example, nil requests Tor object? ore presumed to go through a cache server, 
which is Irausparent to the oxer. II intercepts information requests and decides whether it wil) 
proviiJe a response from a cached local copy or from o remote ^formation source. After 

s fetching information from a local source, the each© server decides whether to store it locally, 
and if so, for how long. A request for information which can he provided from a local copy 
is known as u "cache hit". Conversely, a request for information which is not stored locally is 
known o "cache miss". When I he stomge' determination algorithm is wclt-dcsigned the 
probability of a cache hit is grontly improved, and apparent response time to user requests 

10 (related to QoS) is reduced. Funher, every Information request satisfied by locally cached 
content (cache hit) reduces traffic on the external network, perrnittinj- shorter response times 
over the external network. 

If the requested object is in the cache, it is aexl determined in a step 206 whether the 
15 requested object is marked as static. If so (Y), it is tben tlelcrmiawl in a ttep 208 whether to 
update ihe cached copy or to ose it to fulfill the user request using any suitable standard 
algorithm f« caching static objects, soob as comparing tiie objects '"age" (the tune elapsed 
since it has last been refreshed) to the TTL (if the original server of the page specifies a 
TTL), asking the server the latest modification tune, etc). 

If the requested object is in cache, and it is dynamic (N, step 206), it is determined iu a step 
21 0 whether Ihe cached copy's TTL ("shelf life") is less than its age. 

If the cached copy's TTL is less than (a lower number than) its age (Y, step 210), it is 
25 considered ro be "stale", and in a step 2 12 the cache: 

- updates toe object's access statistics (number of user requests per time 
period - bud tew minutes, last hour, etc.); and 

- fetches the object from the original server. 

5ft If the cached copy's TTL is equal to or greater than its age (N, step 2 1 0), it is considered to be 
"fresh", and in a step 21 4 the cache: 

fulfills Ihe request using the caclwd copy; und 

- updates the ohject's access statistics. 

35 Optionally, if the lime difference between the cached copy's age and its TTL is less than a 
given time, and the number of recent user requests is more than a given rate, it is considered 
to be "tipped" and "popular", and the cache fetches the object from the server in what is termed 
an "anticipated refresh". 

40 AoVlirionally, fm each time Ihe cache fetches an object from the server (sec steps 204 and 
212), the following procedures are performed by the cache, in a step 216: 

a. if another fetch for the same object is ongoing (e.g. due to a previous user request), 
the cache warts for the previous fetch to complete, rather than duplicating it request; 
- b. fetches the object from the server, 
*s c, replaces its cached copy, if present v by the op- to -dare object } after having compared 

them to determine whether the object had changed since it was last frfched; 

d. initialize or «pd?tc ihe object's change statistics {number of changes per time 
period) accordingly; 

e. mark the object as static or dynamic content depending on the original server's 
50 reply (presence of o modification time, or of a nonzero TTL, etc.); • 
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f. if the object is dynamic set its TTL (7) to the appropriate VJlue with respect to the 
overage time between changes x (dtiemuncd from the object's change statistics), the number 
of user requests per time unit ri (determined from the objects access statistics), ami a selected 
one of the following p/ocemires (A-E): 

A» maximum error probability ( X w, uch is the average ratio of the number of 
requests fulfilled using a cached copy whow cmesponding original object has clanged for 
more than a given time )Y, over the total number of requests: 



fi. maximum cttot rate ( no ), which is the overage number per time unit of requests 
fulfilled using a cached copy whose corresponding original object has changed for more Than 
a given time >K 



-('* -s 



n J 



maximum delay ( D 9 % wluch is the average time between an object change and 
when the cached copy is refreshed: 

„ CD, 



a any combination of the above (AJi.C), laking the lowest 7} or 

30 

E, any combination of the above (A,B,C). but keeping T within a jpredetcrmined 
window of "reasonable" values bounded by 7'mm and Jinax. 

In the above equations, the following parameters have been used: 
35 7 is Time To I .ive for the dynam ic object; 

IV is a given time since the original object lias changed (i.e., bow long H is outdated); 
x is on average time between changes, which is determined from the object's change 
statistics; 

n is number of user requests per time unit (e.g., frequency); 
40 po is maximum error probability, which is the average ratio of tbc number of requests 
fulfilled using a cached copy whose corresponding original object lias changed for 
more than the given time fV 7 over the total number of requests; 
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Tfo is maximum error rate, which is the overage number per time- unit or requests full) I led 
using a cached copy whose' corresponding original Object has changed for more than 
the given lime W\ 

J)o is maximum delay, which is ihe overage time between an object change «nd when the 
5 cached copy 5s rcfrwJtetl 

Selecting and Adjusting TTL 
A simplo mode},, which takes no other parameter than Ihe average ihne between changes (i) 
of the object is the exponential decreose model - that is, (he probability that the object docs 
10 not change during a time period of length / ll considered 10 be of the form c *' T . Jl is, of 
course, I for t m 0, and tends to 0 as t tend* to + (oo) 

Considering a situation with many wser requests, as compared to the refresh rote of the cached 
copy, so Ihe cache will update at regular intervals, despite being user-driven. 

15 

Avyiv fce , frequency of changes 

The model's consistency is now checked, especially whether the average update frequency of 
o given object is V t. Let p', (0 be Ihe probability lhai k changes over a lime period off. 
By construction: 

20 

And, for A 0, p\ = [ p[. t m - , * - [ FiM^f* 



(the probability thai k changes occur over [0:1 J is the sum over » of Ihe 
23 probability that * - 1 changes occurred between 0 and «, that there was a 

cliarige between « and «+i!h, nnd none berween w+dw and r). 



One can deduce by recurrence that: t 




43 
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Now the average number of changes over o time period of / is (by definition of ovcraging): 



The error probability is defined as being the percentage of cases where the user receives the 
cached copy whereas the object has changed on the origins] server, imd ihe cached copy is 
io outlined by more than a given time W. Figure 3 illustrates ihe percentage error (vertical 
axis, in percentages) versus time (horizontal axis, to seconds) for instance, with W-l 5s, t - 
60s, end r- 30s. The graph illustrates tho probability that the cached copy is "stole" by more 
than >P seconds. In tho graph. Wis 13 seconds, so the probability is zero between ] and 15 
seconds. Then, ihe probability rises until 30 seconds, vrhen u rc-fcfch occur* [T- 30*), os 
J> illustrated buy the sawtooth pattern 302. This pattern repeats itself every 30 seconds. 
Between 30 and 45 seconds, the probability is again zero, and a re-retch occuis at 45 seconds, 
and tho probability rises until 60 seconds, as illustrated by the sawtooth pattern 304. A 
similar revolt is shown by Ihe sawtooth pattern 306 between 75 and 90 seconds. 

20 Given that tho cache is supposed to update the copy of the object at regular intervals, the 
calculations can be jjerformcd over a time interval (0;rj. 

The average time E within ihe interval during which the cunt eat is uotdaled L» the sum over 
all intervals [tr, u+d/i] of the probability that there was no change between 0 and u, but thai 
25 there ww a change between u and « hl», multiplied by the length of time during which the 
content remains outdated hy more than W. That is: 




t 
j 



5 



Hence, the changes do have an average frequency of 1/ 1. 



Average error probability 
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£-(r->f)-T(1-e <■ ) 



or, wrUi « = 



From ibis, the error probability can be deduced as: 

Par y^-ltO 



Easicr-lo-handlc bimoib can be cak-utetctl by letting: jr/ a > = £_ + „ _ » 4 c -« 



Its derivatives are: F" (e) = 



F"'(a)=l-e- 

Since F(0) - FtO) = F"(0) - 0, find F~(a) is posilivc for any a >= 9, H(U F(0) 
ultimately, l : (U) are os wcH. So: 
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There arc better, wsy-to-find bounds for tbe case of ti>=l , but ilial is no| an interesting case 
(T-W t). Hence. 
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Maximal TTL to <twure a given trror proWbilitY 

/\u upper bound should be set on TTL which stilt ensures that the enw probability is below a 
given threshold p«. 



Par $ pQ <=™(a-1+OS/> t 



V 
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Therefore, an acceptable TTL is: 



MwimuiTTLtQa 

10 Thi J is the sure as for (he error probability, replacing p^byn^/n tsr. 



< 




n 






i 


I- 


W 












\ n i 





?S Increfore, nn ncceptablc TTL is; 



35 Average delay fur at: 

It is useful to consider the average delay f> between (he cached copy and the original object 
in the case where the latter has changed during n given interval. First, calculate 
which is the average delay knowing that the change oocnrrr d at a time « between 0 and T. 

1 



< 6t(u) > - — [ <M V - «) 



2T 
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From this, ibe overall tMay <A/> can be tteduceJ by uifegniting <&f{u)> iim« the probability 
tbal <t cbfmfce occurred between a twd u ■» d ». uvtr [0;7]: 



* ■ r 2T 

410-^-^-0) 



25 Since the <Al> ejaculation does not take bio Recount that there has been o ehangs within 
[0;7], il should be divided by the corresponding probability to get Ot 
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•ndsince: £L_£_ + a - l+e-*0 

6 2 



and: ^-a+l-SZO 

3 



Thai: 



D S 1 -^T 



Tn erder to limitJE? to .Dp, a TTL sftoukS be chew., such ay: 
2r 

7* r 

6 * 0 2t 



►6* 



i x 
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From the abwe, the (cHowing tables of TIT. [T ($)) versus a uurobtx uf crrw probabilities, 
fw various values of t and fPcan be derived; 



10 
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From the above, the following conclusions can generally be made. 

10 the probability of the retrieved information being "stale" (older than W) is essentially zero 
fox values of T(s) (Time to live) less Ihuo or equal to W and increases with incre;uaiig TT>) 
according to a cfecnying exponential, approaching 1 00% probability of error at infinity. This 
observation that U>e error probability is ?rro for values of T(s) less than or equal to W is 
esseutfaliy * "trivial" result, since it is clear that no information can he older than W if it is 

15 updated more frequently than W. Cfenrly, however, the average update interval x has ,i 
significant effect on how steeply the error probability climbs for values of T(s) grcutcr than 
W. llw greater the average update interval T with respect to \V. ibe less sharply the error 
probability rises with increasing values of T(s). 

20 Figure 4 is a graph 400 illustrating the evolution of error probability p m (vertical axis, 
expressed as a percentage (%)), as a function of the TTL (in seconds (s)) for various values of 
x (If His). The liiw 402 is for x = 20s, the line 404 is for x = 60s. the line 406 is for T = 
300s, and the line 408 is for x - 600s. 

25 A graphical illustration of a technique for choosing a TfL .to maintain error probability 
belvw a threshold value obtained by identifying the value of g^on flic vertical axis of the 
graph 400 and following an imaginary line horizontally across the graph 400 to when: it 
intersects the curve (402,404,406, or 408) for the appropriate value oft. By way of example, 
if an error probability threshold value of 10% is chosen, then the TTL for x » 20* (lint 402) 

5ft is a Utile urxler 30 seconds, for x - 60s (line 404) it is about 35 seconds, for T - 300s (line 
406) it is about 85 seconds, and for x - 600s (line 408) it is about 150 seconds. 
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By deploying caches for italic and dynamic content at nppropriale locations in the Internet, 
and by selecting appropriate update characteristics for cached dynamic content as described 
hereinabove, effective user response tinges to both dynamic and static content can be reduced 
while simultaneously reducing congestion on the external network (i.e., the Internet). 

5 

The first time on end war (110) receives o web pnge» the web page may optionally plant a 
cookie in the user's browser. 'Ihercafler, every Ume the end user accesses the web page, the 
browser sends the cookie along with Uie access request. The cookie lolls (he server (104) 
what the server wonts (or need?) to know about the end user, in addition to simply the fnct 
10 that the user wonts to retrieve the web page. Kor example, if ibe web page is a personalized 
web page of n single end user, the server knows the end user's preferences. 

According to u feature of the invention, the cucbe (108) can look at the cookie and see if the 
requested web page is a personalized page of a single end user, oi if the cookie indicates the 
IS doc ess request is coming from a global cixl user. If the requested web page is a personalized 
web page for a single cod user (or if there is some other indication thai the web page is 
supposed to be modified each time that it is wcesscd), then the web page should not be 
uichcd. This can be accomplished by setting TI L = 0. Else (eg., global end user), TTL « 
si mply set as described hereinabove, 

70 

Although the invention has been described with respect to a I muted number of embodrmcnls, 
it will be appreciated thai many variations, modifications and other applications of the 
invention may be made, end are intended to be within the scope of the invention, as dbckEwJ 
herein. 

25 
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CLAIMS 

What is claimed is: 

1 . System for caching objects laving dynornic content on ibe Internet, comprising: 

a cuobc connected in the Interne! for storing and updating copies uf dynamic content. 

2. System, according 1 0 claim t , wherein: 

(he cache is disposed at a location selected from the group consisting of major Internet 
switching locations, dial-in aggjegation points, and corporate gateways. . 

3. System, according to claim ) , wl»erein: 

the cache also stores and updates copies of static content. 

4. System, ncconling to claim 1 , further comprising: 

means for monitoring one or more of the objects to determine update characteristics 
thereof; and 

means tor adjusting a lime to live (TTL) parameter for the objects based upon tl>c 
update cbaractcristics. 

5. System, according to claim 4, further comprising: 
means for determining an age of an object; and 

means for updating the object if the TTi. for the object is less than it$ age. 

6. System, according to clitim 5, wherein: 

the TIL for c»ch object is calculated ttcconling lo an average time of changes for the 

object. 

7. System, according to claim 4, furttuff comprising: 

means for determining a probability of error for each of the objects; and 
means for adjusting the TTL for each of the objects lo maintain its probability of 
error below a predetermined error probabib'ty threshold. 

8. System, according to claim 7, wherein: 

the TTL for each object is calculated according to (he equation: 



wherein: 

T is the Time To Live, for the dynamic object; 

W is a given time since the original object has changed (i.e., how long it is 
outdated); 

t is an average time between changes, which, is determined from the object's 
change statistics; and 

Po is maximum error probability, which is the average ratio of ilw number of 

requests fulfilled using a cached copy whose corresponding original abject has 
changed for more tb*n die given lime »f, over the lulul number of requests. 
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9. System, uccording to claim A , further comprising: 

means for determining an error role for each of the object and 
means for adjusting TfL for each of (he objects to maintain its error rale below a 
predetermined error probability threshold. 

1 0. System according to claim 9, wherein: 

the TTL for each object is calculated according to tbe equation: 



wherein: 

T is Oie Time To Live for the dynamic object; 

W is o given ti Die since the original object has changed (ic, bow long it h 
outdated); 

T is no overage lime between changes, which is determined from the object's 
change statist ics; 

n is number of user requests per time unit (e.g., frequency); and 

n 9 is maximum cnor rate, which is the overage number per lime unit of requests 

fuillllcd using a cached copy whose corresponding original object has changed 

for more than the given time W, 

\ 1 . System, according In claim 4. further comprising: 

means for determining o delay lime for each of ibe objects; and 

means for adjusting TTL for each of the objects to maintain its delay time below a 

predetermined delay threshold. 

1 2. System iiccording to claim 1 1 , wherein: 

the TTL for each objccl is calculated according to the equation: 



wherein: 

T is the Time To Live for the dynamic objccl; 

t is an average time between changes, which is determined from the object's 
change statistics; and 

Do is maximum delay, which is the average time between an object change and 
when Ibe cached copy is refreshed. 

J 3. System, according to claim 4, further comprising: 

means fm determining a1 least one object characteristic selected from tbe group 
consisting of error probability, error rate and delay time for each of the objects; and 

means for adjusting Til to maintain the selected objeel charoelertics befow a 
respective threshold value. 
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1 4. System, according to claim 1 3, further comprising: 

means for limiting adjustment of TTt for each of the objects to a range bounded by 
predetcnuincd minimum (7Vnin) and muximum (7'fruw)va!nes for'iTL. 

15. Method of responding to a user request for information having dynamic contend 
comprising: 

storing □ o>py nf tbc dynamic content in a cache; 
establishing a time to live (TTL) lor the dynamic content; 
receiving the user requesl at the cache; 

responding to the user request with the stored copy of the dynamic content if its Tl L 
is greater than its age; and 

retrieving an Mpdatcd copy of the dynamic content and responding to the user request 
with the updated copy if the TTL of the stored copy is less than its age. 

16. Method, according to claim 15, further comprising: 
storing a copy of static content In the cache. 

17. ■ Method, according to claim 15, further comprising: 

determining on average update frequency for the dymunk content; rrad 
detcnairong the TTL for the dynamic content as ci function of its overage update 
frequency. 

1 8. Method, according to claim I S, further comprising; 

C^errnining an average rtpdate frequency for (he dynamic content; and 
determining the TTL for the dynamic content a* a Junction of its average update 
frequency and a predetermined error probability threshold. 

] 9. Method, according to claim 18, further wunprising: 

adjusting the TTL for the dynamic content according to a frequency of user requests 
for the dynamic content. 

20. Method, according to claim 1 5, wherein: 

the TTL for each object is calculated according to the equation: 




wherein: 

T is the Time To Live, for the dynamic object; 

W is a kivto lime strxc the original object has changed (i.e.. ho* long it is 
outdated); 

T is on overage time between changes, which is determined frum the object's 
change statistics; nnd 

po is maximum error probability, which is the average ratio of the number of 

requests fulfil led using a cached copy whose corresponding original object has 
changed far more than the given time V, over the lota) number or" requests. 



21. 



Method, according to claim 15. wherein: 
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the Tit for each object » adcnl»ied according to the equatioiv: 











f- 


w 








"1 
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wherein: 

T is the Time To Live fur the dynamic object; 

W is a given time since the oritpniil objecl has changed (i.e., how Jons it is 
outdiitcd); 

T is an average rin>c between changes, which is determined from the object's 
change statistics; 

n is nomher of user requests per time unit (e.g., frequency); nnd 

is maximum error mtc, which is Ihe average number per time unit of requests 
fulfilled using o cached copy whose corresponding original object has changed 
tor more than the given time >Y. 



Method, ttccorUing lo claim 1 5, wherein: 

the TTL for each objecl is calculated according to the equation: 



0D o 



1+3 



whejx^n: 

T is Ihe Time To Li ve for the dynamic object; 

t is an average time between changes, which is determined from Ihe object's 

. change statistics; and 
Do is maximum delay, which is the average time between an object change and 
when the cached copy is refreshed. 

23. Method, according to claim 1 5, wherein the infbnnation is represented as » web page, 
(be method further comprising: 

a first time Ihe user receives the web page, Ihe web j»ge plants a cookie in the user's 



24. Method, according to claim 23, wherein: 

every subsequent time tho end user requests the web page, the browser sends the 

cookie. 

25. Method, according to claim 24, further comprising: 

the cache looks at the cookie and sec if the requested web page is a personalized page 
of a single end user. 



26. Method, according to chum 23. further comprising: 
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if the requested web page b a personalized wtb page for a single end user, then the 
web page is not cached, 

27. Method, according to claim \ 5, further comprising: 

if ihc infonnntitm w supposed to be modified each time it is accessed, setting TTL «• 

0. 



28. Method of responding to a user request for on object having dynamic content, said 
object originating from a server, comprising: 

storing a copy of the object in a cache; 
establishing a time to live (TTL) for the object; 
receiving the user request at the- cache; 

fulfilling the user request with the stored copy of Ihc object if its TTL is greater than 
its age; and 

fetching as updated copy of the object and responding to (he uses request with (he 
updated copy if the TTL of the stored copy is less than its age- 

29. Method, according to claim 28. further comprising, in the cache: 

first setting the TTL for the object to a reasonable lower limit (Ttnin); and 
adjusting the TTL for the object based on the frequency at which the object actually 
changes 

30. Method, according to claim 28. further comprising: 

each time the cache fetches (he object from the server, performing the following 
procedures: 

a. if another fetch for the same object is ongoing, waiting for the previous fetch to 
complete; 

b. fetching the object from the server, 

c replacing the cached copy, if present, by the fetched object, aflcr having compared 
them to determine whether the objecj had changed since it was last fetched; 
d. initializing or updating the object's change statistics accordingly; 

c. marking the object as static or dynamic content depending on the server's reply; and 
f. if the object is dynamic, setting its TTL (7) to an appropriate value with respect to 

an average time between changes (t) determined from the object's change statistics, the 
number of user requests per time unit (n) determined from the objects access statistics, and 
one of the following procedures (A-E): 

A. maximum error probability; 

U. maximum error rate; 
' C. unuimuno delay, 

D. any combination of the above (A r B r Q, taking the lowest T\ 01 

E, any combination of the above <A,D,C), bin keeping T within a predetermined 
window of "reasonable" values bounded by Train and Tmax. 

3 1 . Method, according to claim 28, wherein the information is represented as a web page, 
the method further comprising: 

a first tiioc the user receives the web page, the web page plants a cookie in the user's 
browser. 



32. Method, according to clBim 3 1 , wherein: 
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every subsequeni time the end user requests the web page, the browser sends the 

cookie 

33. Method. According to claim 32, farther comprising: 

the cache looks at the cookie mid sec iftlw requested web page is n personalized page 
of n siogle end user. 

34. Method, according to claim 31. further comprising; 

if the requested web page is a personalized web page for a single end user, then the 
web pafte is oot cached. 

35. Method, according to claim 2R, further comprising: 

if the information is supposed to be modif>ed each time it is accessed, setting TI L - 0. 
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